Application of Convolutional Neural Network for Image Classification on Pascal VOC Challenge 2012 dataset

نویسنده

Suyash Shetty

چکیده

In this project we work on creating a model to classify images for the Pascal VOC Challenge 2012. We use convolutional neural networks trained on a single GPU instance provided by Amazon via their cloud service Amazon Web Services (AWS) to classify images in the Pascal VOC 2012 data set. We train multiple convolutional neural network models and finally settle on the best model which produced a validation accuracy of 85.6% and a testing accuracy of 85.24%. 1. Introduction Using convolutional neural networks (CNN) for image classification has become the de facto standard largely due to the success of Krizhevsky et al. [1], Szegedy et al. [2], Simonyan and Zisserman [3] and especially He et al. [4], which is now the stateofthe art architecture for image classification. By stacking convolutional layers on top of each other, amongst other architectural artifacts, highly accurate models for task of image classification, detection and segmentation have been discover. These models have been able to nearly match, if not exceed human performance on certain datasets. CNNs are not without their own shortcomings though. Due to the sheer size of networks and the millions of parameters to be optimized, the reliance of highperformance systems increases dramatically. Though modern GPUbased systems are capable of perfoming the intensive computations required by a convolutional neural network, implementing a CNN is unsuitable for those without access to such a high performance system. Also, more complex architectures such as those by [4] take up to 23 weeks to train which demonstrate the inherent difficulties in training deep convolutional neural networks. However, given the fact that CNNs provide the best results compared to other image classification models and also motivated by the credibility of the performance of CNNs by [1]–[4], we train multiple

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

A Radon-based Convolutional Neural Network for Medical Image Retrieval

Image classification and retrieval systems have gained more attention because of easier access to high-tech medical imaging. However, the lack of availability of large-scaled balanced labelled data in medicine is still a challenge. Simplicity, practicality, efficiency, and effectiveness are the main targets in medical domain. To achieve these goals, Radon transformation, which is a well-known t...

متن کامل

Generic Object Detection with Dense Neural Patterns and Regionlets

This paper addresses the challenge of establishing a bridge between deep convolutional neural networks and conventional object detection frameworks for accurate and efficient generic object detection. We introduce Dense Neural Patterns, short for DNPs, which are dense local features derived from discriminatively trained deep convolutional neural networks. DNPs can be easily plugged into convent...

متن کامل

A Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images

Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...

متن کامل

Fully Connected Deep Structured Networks

Convolutional neural networks with many layers have recently been shown to achieve excellent results on many high-level tasks such as image classification, object detection and more recently also semantic segmentation. Particularly for semantic segmentation, a two-stage procedure is often employed. Hereby, convolutional networks are trained to provide good local pixel-wise features for the seco...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1607.03785 شماره

صفحات -

تاریخ انتشار 2016

Application of Convolutional Neural Network for Image Classification on Pascal VOC Challenge 2012 dataset

نویسنده

چکیده

منابع مشابه

Learning Document Image Features With SqueezeNet Convolutional Neural Network

A Radon-based Convolutional Neural Network for Medical Image Retrieval

Generic Object Detection with Dense Neural Patterns and Regionlets

A Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images

Fully Connected Deep Structured Networks

عنوان ژورنال:

اشتراک گذاری